Zhile Ren | Research Statement
نویسنده
چکیده
Figure 1: COG descriptor encodes orientation-invariant gradient feature for objects with different views. I develop new representations and algorithms for three-dimensional (3D) scene understanding from cluttered indoor RGB-D images and outdoor video sequences. I introduce novel representations for 3D object detection systems that localize objects with cuboids and describe room layouts by Manhattan structures. Using view-invariant 3D features, I capture 3D style-variations and design systems to detect small objects by modeling support surfaces. Finally, I develop cascaded prediction frameworks to model 3D contextual relationships and enable rapid understanding of scene properties including depth, motion, and segmentation.
منابع مشابه
Cascaded Scene Flow Prediction using Semantic Segmentation
Given two consecutive frames from a pair of stereo cameras, 3D scene flow methods simultaneously estimate the 3D geometry and motion of the observed scene. Many existing approaches use superpixels for regularization, but may predict inconsistent shapes and motions inside rigidly moving objects. We instead assume that scenes consist of foreground objects rigidly moving in front of a static backg...
متن کاملSupplementary document: Transient Attributes for High-Level Understanding and Editing of Outdoor Scenes
Our Transient Attribute Database: selection of representative images (Section 1.1), screen captures of our crowdsourced annotation tasks (Section 1.2), comparison of our aggregated labels to meteorological data (Section 1.3), and correlation between attributes (Section 1.4). Our attribute recognition method: description of the image features and encoding methods used (Section 2). In addition, o...
متن کامل3D Object Detection with Latent Support Surfaces
We develop a 3D object detection algorithm that uses latent support surfaces to capture contextual relationships in indoor scenes. Existing 3D representations for RGB-D images capture the local shape and appearance of object categories, but have limited power to represent objects with different visual styles. The detection of small objects is also challenging because the search space is very la...
متن کاملTransient Attributes or High-Level Understanding and Editing of Outdoor Scenes
We live in a dynamic visual world where the appearance of scenes changes dramatically from hour to hour or season to season. In this work we study “transient scene attributes” – high level properties which affect scene appearance, such as “snow”, “autumn”, “dusk”, “fog”. We define 40 transient attributes and use crowdsourcing to annotate thousands of images from 101 webcams. We use this “transi...
متن کاملProgrammable Dna Delivery To Cells Using Bioreducible Layer-By-Layer (lbl) Polyelectrolyte Thin Films
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 61 Autobiographical Statement . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 62
متن کامل